Computational protein profile similarity screening for quantitative mass spectrometry experiments
نویسندگان
چکیده
MOTIVATION The qualitative and quantitative characterization of protein abundance profiles over a series of time points or a set of environmental conditions is becoming increasingly important. Using isobaric mass tagging experiments, mass spectrometry-based quantitative proteomics deliver accurate peptide abundance profiles for relative quantitation. Associated data analysis workflows need to provide tailored statistical treatment that (i) takes the correlation structure of the normalized peptide abundance profiles into account and (ii) allows inference of protein-level similarity. We introduce a suitable distance measure for relative abundance profiles, derive a statistical test for equality and propose a protein-level representation of peptide-level measurements. This yields a workflow that delivers a similarity ranking of protein abundance profiles with respect to a defined reference. All procedures have in common that they operate based on the true correlation structure that underlies the measurements. This optimizes power and delivers more intuitive and efficient results than existing methods that do not take these circumstances into account. RESULTS We use protein profile similarity screening to identify candidate proteins whose abundances are post-transcriptionally controlled by the Anaphase Promoting Complex/Cyclosome (APC/C), a specific E3 ubiquitin ligase that is a master regulator of the cell cycle. Results are compared with an established protein correlation profiling method. The proposed procedure yields a 50.9-fold enrichment of co-regulated protein candidates and a 2.5-fold improvement over the previous method. AVAILABILITY A MATLAB toolbox is available from http://hci.iwr.uni-heidelberg.de/mip/proteomics.
منابع مشابه
Computational Protein Coregulation Screening for Quantitative Mass Spectrometry Experiments
Motivation: The characterization of enzyme substrate specificity is a key step towards understanding signal transduction and protein interaction in cellular pathways. Exhaustive manual identification and biochemical validation of enzyme-substrate relationships is not feasible. Screening procedures that use quantitative protein reporter ion trace information to identify or computationally enrich...
متن کاملComputational and Statistical Methods for Protein Quantification by Mass Spectrometry
The definitive introduction to data analysis in quantitative proteomicsThis book provides all the necessary knowledge about mass spectrometry based proteomics methods and computational and statistical approaches to pursue the planning, design and analysis of quantitative proteomics experiments. The authorвЂTMs carefully constructed approach allows readers to easily make the transition into the ...
متن کاملComputational and informatics strategies for identification of specific protein interaction partners in affinity purification mass spectrometry experiments.
Analysis of protein interaction networks and protein complexes using affinity purification and mass spectrometry (AP/MS) is among most commonly used and successful applications of proteomics technologies. One of the foremost challenges of AP/MS data is a large number of false-positive protein interactions present in unfiltered data sets. Here we review computational and informatics strategies f...
متن کاملAssessing Bias in Experiment Design for Large Scale Mass Spectrometry-based Quantitative Proteomics*□S
Mass spectrometry-based proteomics holds great promise as a discovery tool for biomarker candidates in the early detection of diseases. Recently much emphasis has been placed upon producing highly reliable data for quantitative profiling for which highly reproducible methodologies are indispensable. The main problems that affect experimental reproducibility stem from variations introduced by sa...
متن کاملComputational and Statistical Analysis of Protein Mass Spectrometry Data
High-throughput proteomics experiments involving tandem mass spectrometry produce large volumes of complex data that require sophisticated computational analyses. As such, the field offers many challenges for computational biologists. In this article, we briefly introduce some of the core computational and statistical problems in the field and then describe a variety of outstanding problems tha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 26 1 شماره
صفحات -
تاریخ انتشار 2010